Labeling Gaps Between Words: Recognizing Overlapping Mentions with Mention Separators

نویسندگان

  • Aldrian Obaja Muis
  • Wei Lu
چکیده

In this paper, we propose a new model that is capable of recognizing overlapping mentions. We introduce a novel notion of mention separators that can be effectively used to capture how mentions overlap with one another. On top of a novel multigraph representation that we introduce, we show that efficient and exact inference can still be performed. We present some theoretical analysis on the differences between our model and a recently proposed model for recognizing overlapping mentions, and discuss the possible implications of the differences. Through extensive empirical analysis on standard datasets, we demonstrate the effectiveness of our approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supplementary Material for Labeling Gaps Between Words: Recognizing Overlapping Mentions with Mention Separators

This is the supplementary material for “Labeling Gaps Between Words: Recognizing Overlapping Mentions with Mention Separators” (Muis and Lu, 2017). This material explains in more depth the issue of spurious structures and also the experiments settings. 1 Details on Spurious Structures About mention hypergraph, we remarked in Section 3.1 that the normalization term calculated by the forward-back...

متن کامل

A Joint Framework for Coreference Resolution and Mention Head Detection

In coreference resolution, a fair amount of research treats mention detection as a preprocessed step and focuses on developing algorithms for clustering coreferred mentions. However, there are significant gaps between the performance on gold mentions and the performance on the real problem, when mentions are predicted from raw text via an imperfect Mention Detection (MD) module. Motivated by th...

متن کامل

Variability in word duration as a function of probability, speech style, and prosody.

This article examines how probability (lexical frequency and previous mention), speech style, and prosody affect word duration, and how these factors interact. Participants read controlled materials in clear and plain speech styles. As expected, more probable words (higher frequencies and clear speech second mentions) were significantly shorter than less probable words, and lexical frequency wo...

متن کامل

Joint Mention Extraction and Classification with Mention Hypergraphs

We present a novel model for the task of joint mention extraction and classification. Unlike existing approaches, our model is able to effectively capture overlapping mentions with unbounded lengths. The model is highly scalable, with a time complexity that is linear in the number of words in the input sentence and linear in the number of possible mention classes. Our model can be extended to a...

متن کامل

Recognizing Biomedical Named Entities Using Skip-Chain Conditional Random Fields

Linear-chain Conditional Random Fields (CRF) has been applied to perform the Named Entity Recognition (NER) task in many biomedical text mining and information extraction systems. However, the linear-chain CRF cannot capture long distance dependency, which is very common in the biomedical literature. In this paper, we propose a novel study of capturing such long distance dependency by defining ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017